unicode_categories 0.1.1

Query Unicode category membership for chars
Documentation
  • Coverage
  • 97.3%
    36 out of 37 items documented1 out of 37 items with examples
  • Size
  • Source code size: 409.10 kB This is the summed size of all the files inside the crates.io package for this release.
  • Documentation size: 7.16 MB This is the summed size of all files generated by rustdoc for all configured targets
  • Links
  • Documentation
  • swgillespie/unicode-categories
    17 5 6
  • crates.io
  • Dependencies
  • Versions
  • Owners
  • swgillespie

unicode-categories

unicode-categories is a simple crate that adds many extensions to the char type that allow for the querying of whether or not a character is a member of a certain category of unicode characters.

use unicode_categories::UnicodeCategories;

assert!('a'.is_letter_lowercase());
assert!('A'.is_letter_uppercase());
assert!('\n'.is_other_control());

This crate can be obtained from crates.io and used as a dependency like any other crate.

Regenerating the tables

The script in scripts/unicode.py reads a file named UnicodeData.txt, which is assumed to be in the current working directory. This file can be obtained from ftp://ftp.unicode.org . The script outputs a rust file to standard out, which can be piped to the desired location.

License

Licensed under either of

at your option.

Contribution

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.